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1) Introduction 


Pathfinding is an important aspect of Computer Science, where iťs a solution that aims to 
solve the shortest path problem in a graph data structure. Pathfinding is the plotting of the 
shortest route between two points. This has various applications in fields such as 
Artificial Intelligence, Network Theory and Game Development. For instance, 
pathfinding can be used in mapping applications like Google Maps to find the best route 
between origin and destination or it can used to send packets across a computer 


network by finding the shortest route between start and end. 


1.1) Personal interest in this essay 


I’m interested in this particular aspect because during my Personal project journey at 
Middle Years Programme, I developed a game, which utilized artificial intelligence. 
However, the Al in my game wasn’t complex because it didn’t have proper pathfinding 
and I wasnt able to implement pathfinding because of my limited understanding then. 


Therefore, I’m using this essay to develop my understanding of pathfinding. 


1.2) Aims of this essay 


Moreover, | explored this essay by investigating how the algorithms perform 
(efficiency) when finding routes on maps. For example, finding the shortest distance 
between two cities. This topic also links to IB Computer Science as it utilized abstract 
data structures and computational thinking. Moreover, | applied my knowledge from 
my course to this essay for instance | used lists and queues in my implementation which 


we learnt in IB Computer Science. 


1.3) The need for research in pathfinding algorithms 


This essay explored the efficiency of the algorithms behind pathfinding by looking at 


certain algorithms which are the Dijkstra’s algorithm and A“ algorithm. The algorithms 
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were implemented because | want to investigate these particular algorithms in a practical 
environment. Consequently, the research question is: How far does Dijkstra’s search 
algorithm compare to A* search algorithm for finding the shortest path in a graph as the 


size ofthe graph increases by number of vertices? 
1.4) Need for efficient algorithms and algorithmic complexity 


This essay investigated the relationship between time taken for an entire operation 
with respect to the increase in size of the relevant dataset. There is a need for this 
research and an important part of algorithms is efficiency. Algoritnms are solved by 
programmers and understanding the efficiency of algorithms is important in programming 
as it allows for growth. Since programmers think in terms of maintaining code in the long 
term. Therefore, creating efficient algorithms is about decreasing the amount of 
recursive operations that is required to complete the task with respect to the size of 
dataset.' Furthermore, this implicates to pathfinding as it’s important to quickly 


calculate the shortest path with larger datasets. 
1.5) Use of pathfinding algorithms in satellite navigation and others 


In the real world, pathfinding algorithms have various uses. For instance, take the internet 
and a pathfinding algorithm could be used in finding the shortest path between a server 
and a node. Another real-world application is in satellite navigation, in which pathfinding 
algorithms could be used to calculate the shortest route from current location to the desired 
destination. Google implements a pathfinding in some form by turning a geographical 
map into a mathematical graph and using Dijkstra’s algorithm using map data to 


find the shortest distance between two places.“ 


1 Choudry, Humzah., 2017. Understanding Algorithm Efficiency And Why It’S Important. [online] Medium. 


2 Lanning, D. R., Harrell, G. K., & Wang, J. (2014). Dijkstra’s algorithm and Google maps. 


1.6) Primary research being used 


For the research | collated data from primarily from my own experiment because it 
would lead to authentic results. The sources for my primary research was a subject expert 
who is a software engineer with 15 years of experience with algorithms in general. The 
subject expert served as guidance“ and consultation for the direction of my research. 


Additionally, | gathered data from a survey to establish the need and verify it. 
1.7) Secondary research in the essay 


The secondary research which is a range of sources includes books, research articles, 
lecture notes and web articles was there to help my understanding and give direction to 
my experiment. Additionally, completing a course on graph theory to build my 
foundational understanding.“ With these many sources, | minimized bias in my research, 


and it gave a balanced review. 


2) Research findings and analysis of both algorithms 


2.1) Graph Theory — Theory behind route planning in satellite navigation"? 


A graph G = {V, Ej is a data structure which is a set of points called vertices (V) (vertices 
are also known as nodes) which are connected by a set of lines called edges (E). In 
essence it is a set of objects (vertices) where there are relationships between pairs of 
objects, for instance a graph of road networks could represent cities (as nodes) and show 


the roads/highways between cities (as edges). Example of a graph is shown below: 


3 See section 5.1 and appendix D 

4 See appendix F 

5 See appendix G 

6 Refer to appendix H 

7 Kulikov, Alexander S. “Introduction to Graph Theory." University of California San Diego, National 
Research University Higher School of Economics. 

8 Quinn, Catherine, et al. Mathematics for the international student: mathematics HL (option) 


This is a graph that | made, where the circles (A, B, C, D, E, F) are the vertices and the 
lines that connect them are the edges. This shows how the objects relate to each other. 
For example, node A is directly connected (adjacent) to node B and node C but there is 


no direct connection with node F. 


The edges can be weighted, meaning it can be assigned a value, for example the length 


of aroad. The graph above shows an unweighted graph. 
Definitions:? 


Path: A path is a series of distinct edges such that each edge (other than initial edge) 
starts with a vertex where the prior edge ended. A simple path is when all vertices are 


distinct. 


9 Appendix A for additional definitions 


SP 


(Self-made) This is a directed graph, where the incident edge of node B adjacent to node A is directed 
as it shows a direction arrow, where you can traverse from A to B but not the other way. Similarly, you 
can traverse from C to A but not from A to C. The incident edge on node B and D are said to be 


undirected and are bidirectional. 


Weight: The weight is a number that’s associated with an edge. The quantity can 
represent anything such as distance, cost and etc. The weight of a path is sum of all edge 
weights from start vertex to end vertex. The shortest path is path with minimum weight 


between two vertices. 


This is a weighted graph, with value assigned next to their respective edges 


2.1.1) Implementation of my data set (graph) 


For my experiment | created graphs using adjacency tables and implemented the graphs 
in the code and calculated heuristics.'° This served as the dataset for my experiment. | 


trained the data to verify if everything in the graph is correct. 
2.2) Shortest Path Problem — Theory behind route planning in satellite navigation" 


The shortest path problem in graph theory is finding a path between two vertices such that 
the weight of the path is minimized. As said earlier the shortest path is the path where the 
sum of all edges between the two vertices are minimum. To find the shortest path, a search 
algorithm needs to be implemented, there are two kinds of search algorithm. Uninformed 
search is when the program has no information about cost from start node to goal node. 
An example of uninformed search algorithm is Dijkstra"s algorithm created by Edsger 
Dijkstra in 1959. Informed search also known as heuristics search are when algorithms 


use heuristics (estimate) using given information in order to find the shortest path. 
2.3) Algorithm Analysis and complexity 


When analyzing algorithms, an important aspect that is to be considered is the 
computational complexity. Computational complexity has two parts, time complexity and 
space complexity. Time complexity is a measure of runtime as the input increases and 
space complexity is measure of memory usage as the input increases.'* This essay will 
only focus on time complexity. Each function in programming has a runtime. Time 
complexity determines how the runtime of the function grows as the number of elements 


in that function grows. For example, if an 1D array is given and a function related to that 


10 See appendix A 
11 Russell, Stuart J, and Peter Norvig. Artificial intelligence: a modern approach. 
12 Sharma, Akash. “Time and Space Complexity Tutorials 8. Notes | Basic Programming." HackerEarth 
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array is given. Thus, the time complexity is the runtime of the function as the number of 


elements in that array increases. 
2.3.1) Need for analyzing algorithms 


The most important reason for analyzing algorithms is to find characteristics in order to 
evaluate the suitability for many applications or in order to compare to a similar algorithm 
used in the same program. The primary goal for analyzing algorithms is to accurately 
predict the performance of algorithms when implemented in programming in order to 
determine the resource usage, set parameters and compare similar algorithms. We use 
the analysis to build mathematical models to describe the performance of real-world 
implementations of algorithms.!“ This leads to us having an informed understanding of the 
particular algorithm and suggest informed improvements. In the real world, programmers 
have to deal with the limitations of resources such as memory and time, there is a need 
for efficient algorithms because programmers keep scalability in mind when implementing 
algorithms into their programs. For instance, in satellite navigation implementing 
pathfinding algorithms is increasing complex compared to a simple graph with a few 


nodes. 
2.3.2) Big O notation and its relation to time complexity 


The big O is a mathematical notation to express time complexity of a function in worst 
case. For example, if a function shows a linear relation between number of elements and 
runtime [f{n)=mn+c] then the big O notation for the time complexity is O(n). O(1) implies 
that there is no change in runtime as number of elements increases (constant time). The 


big O notation only uses terms with the highest degree ignoring any constants. For 


13 Sedgewick, Robert and Philippe Flajolet. An introduction to the analysis of algorithms Second Edition. 
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instance, if the function has a time complexity of 3n“+100n+5 then the time complexity in 


terms of big O is O(n?).'4 
2.4) Explanation and use of Dijkstra"s algorithm in pathfinding 


Dijkstra"s search algorithm is one of the search algorithms that has applications in finding 


the shortest path in a graph. The general algorithm isl!>1161117]: 


1. Let d(v) be the path cost of neighboring vertices v from starting node s. 


2. d(v) = © for a neighboring node as we dont know the value of it vet, moreover d(s) = 0 


as it is the starting node. 


3. From the current vertex check the weight of neighboring nodes and calculate a 
temporary value of d(v). Compare all the values and assign the smallest value. For 
instance, if our current node is node N and with distance of 4 from starting node and the 
edge connecting its unvisited neighbor M has value of 2. Then d(v) will be 4+2 = 6. If M 


was marked with a value greater than 6 then assign it to 6. 


4. After checking all the neighboring nodes from the current node, mark them as visited 


such that those vertices won't be checked again. 
5. If the goal node has been marked visited or if the smallest d(v) = +, then end algorithm. 


Explaining this with an example. Assuming a weighted graph with the starting vertex as A 


and let the goal, vertex be F: 


14 Adamchik, Victor S. Algorithmic Complexity, Carnegie Mellon University, 2009 

15 Cormen, Thomas H. et al. Introduction to Algorithms, 3rd ed. 

16 Pound, Mike. “Dijkstra’s Algorithm - Computerphile.” 

17 Quinn, Catherine, et al. Mathematics for the international student: mathematics HL (option) 


11 


This graph with 6 nodes is weighted and undirected. 


Set each node to infinity for the total distance as they havent been visited vet. Initialize 


the starting distance from start node as 0. 


co 
Node A has been marked as blue for start and node F has been marked yellow for 
end. The numbers in red signify the value of the node that is cumulative from its 
path. For example, A doesn’t have any previous connections thus it will be 0. 
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With the current node check its adjacent nodes, if the current value of node (it will be 
infinity at first) is higher than value at current node plus the value of the connecting edge, 
then update the value of adjacent with current plus weight of the connecting edge and add 


neighbour with the minimum value to become part of shortest distance list. 
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co 
Now node B has been updated from infinity to 3. 


#9 is A >Bas3> 9. 


9/11 
Node D has two paths A -> B -> C -> D at 11 cost and A -> D at9 cost. 


14 


In the current node choose the neighbour with the minimum distance and set it as current 


node. 


Keep iterating this process until all nodes have been marked visited. 


3 bo | 
A 5 
= Le 
D 


9 44 


The algorithm has selected the path with the lower cost (A -> D) after iterating and 
updated the cost of D to 9. 


N 


After all nodes have been visited and the target has been reached, we will get the shortest 


distance from A to F. The shortest distance here is 14. 


15 


9 
The process keeps repeating and all the nodes are known. The algorithm 
finds Node F as highlighted by dark green and completes the program. 
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Below is a pseudocode?? of the Dijkstra search algorithm using this pseudocode and the 


information above | was able to implement this algorithm in python 3.8.19 


m 
Ill 
m 


for allveV- {S} 


do dist[v] - œ 


N[v] - NIL 


while Q # 0 


for all v € neighbors[u) 


do if dist{v] dist[u] + w(u,v 


then distív] + dist[u] + w(u,v 


return dist 


2.5) Explanation and use of A“ algorithm in pathfinding 


AT algorithm works on the same principles as Dijkstra’s algorithm.2° Except it utilizes 
heuristics to make decisions when choosing a path.“ Therefore A“ search is considered 


to be an informed search algorithm. Although it tends to use more memory per node 


18 “Dijkstra Algorithm: Example: Time Complexity." Gate Vidyalay, 2020 

19 See Appendix B which has comments that explain each part of the code 

20 Pound, Mike. “Až (A Star) Search Algorithm - Computerphile.” 

21 Russell, Stuart J, and Peter Norvig. Artificial intelligence: a modern approach. 
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compared to Dijkstra’s. In A“ we assign f(n) a cost evaluation function to all the nodes, 


thus??: 
fm) = gn) + h(n) 


Where f(n) is the total cost of the neighboring node, g(n) is the actual cost to travel from 
current node to the adjacent node and h(n) is a heuristics value from that node to the goal 
node. So, referring to graph in the previous section going from A to Bor A to D will cost 
the weight of the connecting edge plus some assigned heuristics. This would lead to the 
algorithm going down path (A->D->E->F) first before exploring (A->B->C->D) because the 
heuristic value for B would likely be high. This will lead the algorithm to find the path faster. 


For my experimentation | borrowed the code with permission for the A“ search algorithm .?$ 
2.5.1) Heuristics in A“ algorithm 


The heuristics value is mainly arbitrary and dependent on context. For instance, if the 
problem was based on geographical map then a heuristic could be calculated by 
finding the straight-line between two points. A heuristic is considered admissible when 
the estimated heuristics cost is never higher than the actual cost from the current node to 
the goal node. Moreover, if heuristics always underestimate, then A* is guaranteed to find 


a solution.““ 
3) Conducting the experiment 
3.1) Aim of experiment 


This experiment aimed to investigate the efficiency of Dijkstra’s search algorithm and 


AT search algorithm in terms of time complexity by finding the shortest path 


22 Hart, P., Nilsson, N., 8 Raphael, B. (1968). A Formal Basis for the Heuristic Determination of Minimum 
Cost Paths. 

23 Refer to appendix C 

24 Kask, Kalev. Lecture 4: Optimal and heuristic search, 2016 
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between real life cities in a geographical region. The cities were represented as an 
undirected weighted graph with the weight representing the guantity of distance. The 
algorithms were executed in repeated trials and it complete the solution and return the 


shortest path between two cities in a given time. 
3.2) Hypothesis 


My first hypothesis for Dijkstra’s algorithm was as the number of nodes in graph increases 
then the time taken for executing the algoritnm will increase by polynomial order of 2 
because the theoretical time complexity for Dijkstra using a simple list or array is 
O(n?log(n)) where n is the number of nodes . My hypothesis for A“ search algorithm 
was as the number of nodes in graph increases then the time taken would increase 
however it would increase at a slower rate compared to Dijkstra’s because the 
theoretical time complexity which is dependent on optimal heuristics will be 


O(log(n)) meaning it would be linear. 


4) Experiment Methodology 


4.1) Independent variable 


In the experiment for both algorithms the independent variable that was being changed to 
produce a result was the number of nodes (vertices) in graph. | chose to use real life cities 
to be represented as nodes in a graph data structure. Moreover, | used a weighted graph 
where the edge weight was representing the distances between the cities in kilometers. 
After repeated executions of the algorithms, | would adjust the graph by adding cities in 


increments of one. 
4.2) Dependent variable 


The dependent variable that was measured is the time taken for the particular algorithm 


to run in nanoseconds. | chose this particular unit of time because modern computers 
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are extremely fast at making calculations, so to observe the changes | needed to use a 


precise unit of time. This was done to operationalize to see the time complexity of the 


algoritnms empirically. 


4.3) Control Variables 


Variable 


Description 


Justification 


Computer system 


The computer system and 

the OS for each algorithm 

program remained same. 

Specs: 
- 8 gen i7 6 core 

processor 

- 16GB DDR4 
3000Mhz RAM 

- Windows 10 OS 
ver. 10.0.18362 


The system remained 
same for algorithm 
operation time to be not 
affected by hardware such 
as primary memory. 


Same IDE 
Development 
Environment) 


(Integrated 


The programs ran in the 
same IDE. 


Using a different IDE for 
program might affect 
runtimes of the program 


Same graph type 


The graph characteristics 
remained same, meaning 


Features of the graph 
must remain same to give 


algorithms will have the 
same start and end node 
in every case (Berlin and 
Paris). 


EI be undirected, | fairness to the algorithms. 
weighted, and finite 
graphs. 

Same start and end nodes | The graphs that will be | To keep consistency 
traversed by the | between graphs because 


changing the start and end 
node might affect times to 
calculate path. 


4.4) Method 


1. Implemented the Dijkstra’s algorithm and the a“ search algorithms in separate .py files. 


2. Took two European cities, Berlin and Paris as the start and goal node respectively. 
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3. Using mapping software pinned the cities between Berlin and Paris and found distance 


from particular city to Paris (for heuristics values in A“) using longitudes and latitudes.?® 


4. Using the mapping software found the actual distances between cities and using this 
data created 20 different graphs? in excel as a list of pair of cities (one column is the node 


and the second column is the child node and third column is cost between cities). 


5. Configured both algorithms for the particular graph, then ran each algorithm 10 times 


and recorded the execution timings each time it ran for both algorithms. 


6. Repeated step 5 for all 20 graphs (from 4 nodes to 23 nodes that’s 20 datapoints in 


total). 

7. Collated data into excel and calculated averages and made graphs. 
5) Findings of the experiment 

5.1) Results 


Table i. These are the results of mean execution time of Dijkstra’s algorithm in 


[ No.of nodes | = nanoseconds against 
mn NONE Table i. shows the number of nodes in the 
Ld ZUT i 
~~ EEE] graph in the left column and the amount of the number of vertices 
Oooo 6| 32220] time taken to complete search in the right 

column in nanoseconds. For example, in the | 
as 
ee EO Oe Sere te ae graph. 
7317777 37920] Dijkstra"s search took 31000 
nanoseconds. 


25 Appendix A 
26 Refer to Appendix A 
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Table ii. These are the results of mean execution time of A“ algorithm in nanoseconds 


against the number of vertices in graph. 


Table ii. shows the number of nodes in the 


| B| 89240 


graph in the left column and the amount of time 
taken to complete the search algorithm in the 
right column in nanoseconds. For example, in 
the 1* row it says the graph has 4 nodes and 
A* search took 43980 nanoseconds to 
complete. 


A 202690 
DEO 215310 


5 

7 
10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 
23 


5.2) Graphs of the results 


Graph i. Line graph of table i. 


Execution time in nanoseconds 


375000 
350000 
325000 
300000 
275000 
250000 
225000 
200000 
175000 
150000 
125000 
100000 
75000 
50000 
25000 
0 


0 


1 


Average execution time of Dijkstra againt no. of nodes 


y = 14963@ 10x. 


2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 
No. of vertices in graph 
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Execution time in nanoseconds 


Graph ii. Line graph of table ii 


Average execution time of A" against no. of nodes 
250000 


225000 


200000 Ea 
y = 9130.2x - 19035... 


175000 i 
b Le 


150000 
125000 
100000 ertia ® 
75000 

50000 ee A 


25000 


0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 
No. of vertices in graph 


6) Analyzing the experiment data 


From the data above we can observe that the data from Dijkstra’s algorithm shows a 
positive correlation between the execution timings and the no. of vertices in a graph, same 
with A“ algorithm. Initially, from the data | can observe that the performance of A“ is 
somewhat identical to Dijkstra’s. However, after 19 nodes the difference in performance is 
observed and very noticeable and Dijkstra’s algorithm search time is growing very quickly 
in comparison to A* algorithm. | believe that more data points would be needed to observe 
this quick change more precisely. | applied linear regression model for data in graph ii. to 
check if it fits the linear model. Moreover, | tested the graph i. for exponential growth. 


Additionally, put the values in y-axis on a logarithmic scale (in base 2): 
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Logarithmic scale of graph i. against the number of nodes 


y = 0.1531x + 13.869 


Logarithmic scale of execution time in 
base 2 


0123456 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 
Number of vertices in graph 


7) Evaluating the experiment 
7.1) Strengths of the experiments 


A strength of this experiment was the accuracy of the data that is collected. As for each 
value of the independent variable, there were 10 repeated measurements taken (trials). 
This helps with the accuracy of the data because the unit of measurements in 
nanoseconds, which is very small and therefore sensitive, so data would be less concise. 
Thus, it’s balanced out by the repeated trials measurements. Another strength of this 
experiment was the method of measurement for the execution time, a built-in function 
called perf_counter_ns() was used. This is more precise than using the system clock 


to measure time and it returns integer values in nanoseconds. 
7.2) Limitations of the experiments 


| believe that there were limitations to my experiment were that in the first instance, | 
believe the data wasn’t large enough to show the full extent of the algorithms and | 
should've used graphs with 100 or even 1000 nodes. But | wasn’t able to do that because 
| didn’t know how to implement and generate graphs this large. Another limitation, albeit a 
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minor one is that both algorithms don’t have the same implementation of the graph data 
structure. In Dijkstra’s algorithm an adjacency list is used to generate the graph, whereas 
in the A algorithm the graph is created as an object from the class graph. This shouldn’t 
affect the results as much since the time measured is starting from when the search 
algorithm in the main method is executed and not when the graph is generated 


however there should be a level of consistency. 


7.3) Improvements to possible future experiments 


To make improvements for further experimentation. | could start by learning how to 
implement very large graphs using databases, I could use a map application 
programming interface for my database and use it to automatically generate. Although 
it would be very challenging, iťs feasible. The second improvement to experiment would 
be much easier and it would be to implement the algorithms using a consistent data type 


like an adjacency list using a list data type. 


8) Conclusion 


8.1) Summary of the essay 


To conclude this essay, the overall aim of this experiment was to measure the efficiency 
of algoritnms. Efficiency of algorithms is measured by analyzing algorithm complexities. 
This essay focused on time complexity as an aspect of algorithm complexities. Time 
complexity is measure of an algorithm where it shows how the runtime increases 
with respect to the sample size of data that iť s being applied on. In this essay | choose 
pathfinding algorithms as a basis for this because they were complex and had many 
real-life applications. Pathfinding algoritnms are a building block of machine learning and 


Al. Within this essay | explored pathfinding in relation to the shortest path problem Then | 
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recontextualized the shortest path problem in terms of navigation and mapping where in 


my experiment | found the shortest route between two real cities. 


For my experiment | chose Dijkstra’s algorithm which is one of the more popular one in 
pathfinding. Then | chose A“ algorithm which is an extension to Dijkstra except it’s a 
informed search which uses heuristics. In fact, as stated earlier Dijkstra is A* with 
heuristics value of 0 for all nodes. For my heuristics | took the straight-line distance 
between a node to the goal node. My overall hypothesis was that Dijkstra would increase 


at a faster rate in execution time as the size of the graph increases than A“. 


8.2) Conclusion for the findings 


The data from experiment shows them near identical for many graphs however there is a 
noticeable difference for the final few datasets (graphs) and it shows Dijkstra 
quickly growing compared to A*. Thus, to answer the research question there is no 
noticeable difference between the two algorithms in smaller graphs but A“ is generally 


more efficient in terms of timing. 


8.3) Way forward 


This experiment should be replicated for very large graphs to observe the difference in 
more clarity. A future scope for this investigation could be explore the facet of space 
complexity which is another metric for algorithmic complexity which measures how much 
memory is used rather than execution time of an algorithm. Pathfinding is fundamental 
part of Al and it has potential to be utilized in many facets such as machine learning, 


robotics, using Al for mathematical problems, computational biology and medicine. 
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Appendix A — Graphs and implementation 
These graphs serve as the datasets for my experiment 
Additional definitions: 


Adjacent: Adjacent means that when a node is connected to another node by an edge, it 


is said to adjacent to that particular node. 


Degree: Degree of a vertex is the number of Nodes — v Heuristicsvalue — “| 
nodes connected through incident edges. 
Direction: A graph is said to be directed when 
Hanover 
the edges have direction to them, to represent |Kassel 
Munster 
a one-way relationship. An undirected graph is 
Frankfurt 
when the edge incident to two adjacent [Amsterdam 
Cologne 


vertices goes both ways. Strasbourg 399 


Heuristics table of values (values are meant to 


be representative of kilometers): 


For the heuristics in my experiment | 
calculated heuristics between Paris and a city 


using the straight-line distance using 


coordinates in terms longitude and latitude. 


Graph 1 (4 nodes) 


Current node + |Child node |” [Cost (actual) 7 


27 


The table can interpret as showing adjacent nodes to a node and the cost (which is in 
kilometers). For example, Berlin is connected to Cologne by a cost of 559 and it is also 


other way around since it is an undirected graph. 


Graph 2 (5 nodes) 


Current node > [Child node ~ |Cost (actual) v 


Graph 3 (6) 


Graph 4 (7) 


Graph 5 (8) 
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Graph 6 (9) 


Graph 7 (10) 
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Graph 8 (11) 


Berlin 


Graph 9 (12) 


Luxembourg [Verdun | 89 
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Graph 10 (13) 


Luxembourg [Verdun | 89 


Graph 11 (14) 


Frankfurt 
Luxembourg [Verdun | 89 
Verdun 
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Graph 12 (15) 


Luxembourg [Verdun | 89 


Graph 13 (16) 


Berlin 
Luxembourg [Verdun | 89 
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Graph 14 (17) 


Frankfurt 
Luxembourg [Verdun | 89 
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Graph 15 (18) 


Berlin 
Frankfurt 
Luxembourg [Verdun | 89 
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Graph 16 (19) 


Berlin 
Luxembourg [Verdun | 89 
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Graph 17 (20) 


Frankfurt 
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Graph 18 (21) 


Frankfurt 
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Graph 19 (22) 


Luxembourg [Verdun | 89 
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Graph 20 (23) 


Luxembourg [Verdun | 89 


39 


Appendix B - Dijkstra’s algorithm code 


Appendix C — A* algorithm code?” 


Note: | replaced the graphs and heuristics in the main() method with my own data. 


27 Code Taken from AnnyTab: refer to appendix E for permission of code 
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make undirected 


a 


b, dist) 


connect 


.graph_dict. 


aph | 


def astar_search(graph, 


def add to op 


: main() 


Appendix D — Raw data of experiment 


Values in trials columns are in nanoseconds 


Dijkstra’s Algorithm: 


Vertices v [Trial 1  |Trial 2 v [Trial 3 v [Trial 4 v [Trial5  |Trial6 =» |Trial7 + |Trial8 ~ Trial9 + |Trial 10 v 
4 48000 28900 27100 30000 28300 28800 28800 28400 32900 28800 
5 32700 37700 32400 30000 33900 30000 30500 31200 33100 37300 
6 31100 31900 31700 32600 34800 30300 31900 30600 32500 34800 
7 31600 31700 34200 32900 33700 32000 34000 35900 
8 36100 35900 36500 35700 35300 34900 36000 35300 
9 35600 35800 37100 36000 36600 47500 36500 36800 

38400 40100 40000 40900 38000 89800 40100 39200 
41000 45000 41700 41100 40100 43400 43200 47000 
43000 42900 43100 44200 44100 43900 44100 48400 
47700 43600 46400 45600 46100 46600 51100 159000 
47700 87000 46700 46800 47100 48900 46000 48400 
48800 50500 51400 53400 51400 51200 57700 48400 
52000 50900 52100 54700 52100 51500 53100 51400 
110700 98500 97900 102600 104100 109500 99700 102400 
56700 59900 68000 66400 66900 58400 80900 62500 
83000 83200 62000 59500 83900 71900 62400 75700 
20| 104200 84900 95400 117700| 125300 172100 107200 122200 172100 125000 
21| 174200 173800 171100 186400| 105500 133400 174600 177900 177000 175200 
22| 208000 267000 280000 195000| 281800 269400 281300 278000 250800 196600 
23| 402700 420700 311700 306400) 305700 311600 321400 371900 411800 304700) 
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AT algorithm: 


Vertices > [Trial 1 + [Trial itzaz vlTrial4 + [Trial 5 |Trial6 ~|Trial7 ~|Trial8 > [Triat 9 ~ (Trial 10 ~ 
4| 44500 41900| 423001 42800 
5 463000 445001 47200 483001 47600 46800 
e| 49400] sosool 50000! sooool 49500] 49100] 49900] 49100] 48600] 51100 
7] 45900 45800] 48800] 45800 
8[ 52200 52200 528001 52500 
o| 56000) 56200 56200) 573001  57200| 56300/ 57000) 58400 
10| 65600) 68000 68600| 67400]  68100| 68600| 6650o) 67800 
11| sooool 79700 79900| perro sısool 31100 794001 79700 
12 88900] 91000 88100 
13 85700| 86100 84500) 848000 86300 
14 96500] 96400 95900/ 96400] 96300 
15 1035001 102800 103100] 105200] 102800 
16| 111300 111300| 112200] 110300 
17| 76200 74700| 754001 76600 
18 73800] 755001 77800 745001 76500 76400 
19| 1792001 155800] 165600| 1746001 168700| 1670001 169300| 1675001 1696001 172400 

20| 199300 213500| 211100] 204000 
21] 203300 205700| 202300] 198600 
22| 205600] 201100] 206000! 198300] 201100] 2073001 2020001 2059001 198700] 200900 
23| 213200] 219200] 212300] 211800] 216700] 215900} 219300| 215200| 214000| 215500] 
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Appendix E - Permission for the A“ code?® 


| communicated with the owner of this code remotely. 


SID 
February 18, 2021 at 6:28 pm 


Hello there, 


| am student studying in the International Baccalaureate Diploma Program. | am writing this thing called the 
extended essay. It is a research based essay where | take topic in a subject of interest and conduct some sort 
of investigation/analysis. My extended essay is in computer science and in my essay | am analyzing time 


complexities of various pathfinding algorithms in finding the shortest path. 


Would it be okay if | use this code for my essay (I will provide proper references and credits) 


Reply 
ADMINISTRATOR 
February 18, 2021 at 7:05 pm 
Yes, you are more than welcome to use the code in your essay. 
Reply 


28 “A* Search Algorithm in Python." A Name Not Yet Taken AB, 22 Jan. 2020, 
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Appendix E — Evidence of subject expert?? 
Hello si, 
This is a formal email, with questions | have regarding path finding algorithms and time complexity. You can copy these questions and write your response and respond to this email. 


1. Please state who you are and your area of expertise (Le name, profesion, degree & etc) 
Biswajt: this is Biswajit Paria having 15+ yrs experience in advancedJava programming. 


2, What does it exactly mean by time complexity and algorithm complexity in general? 
Biswajit: Time complexity is the part of algorithm complexity. Time Complexity is basically to determine the total unit of ime required to execute an algorithm. 


3. Furthermore, to what extent is time complexity representative of efficiency? 
Biswajit: To maximize the efficiency we need to minimize the use of the resource with the help of measuring time complexity. 


4, How can implementation of an algorithm affect its efficiency? 
Bud it's true that way of implementing algorithm may impact is eficiency. Binary search Tree vs Red Black tree are the great examples of how to make efficient binary search in case when search becomes a linear search. 


5, What are real life applications of path finding algorithms in shortest path problems? 
Biswajit: Google map direction from one place to target location is the real example of shortest path algorithm. 


6. In your opinion, what do you think about the performance of Dijkstra's algorithm in comparison to A" algorithm in finding the shortest? Is there a noticeable difference? Is A’ better than Dijkstra because itis an informed search? 
Biswajit: Yes true. A” ties to look for a better path by using a heuristic function, however it consumes more memory as it does more operations per node. 


Due to COVID-19 pandemic | couldn’t meet the subject expert in person. Therefore, we 


communicated remotely. 


29 Paria, Biswajit. Subject Expert, 2021 
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Appendix G — Survey and results 


Survey guestions 


On a scale of 1to 10, how much do you think pathfinding algorithms will be 
relevant in the future? * 


1 2 3 4 5 6 7 8 9 10 


Not relevant at all ©) O O O O O O O O O Very crucial 


Check the areas of application where you think path finding algorithms are most 
relevant. * 


[] Satellite navigation 
Game development 
Industrial 

Robotics 

Medicine 

Space exploration 
Ocean exploration 
Self driving cars 


Computer networks (like the Internet) 


J 
oO 
J 
o 
J 
o 
J 
E 
J 


Other: 


Do you think we should place importance into research into developing efficient 
algorithms as much as developing new hardware? * 


O ves 
O No 
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Survey results (15 responses): 


On a scale of 1 to 10, how much do you think pathfinding algorithms will be relevant in the 
future? 


15 responses 


0 (0%) 0 (0%) 0 (0%) 0 (0%) 0 (0%) 


Check the areas of application where you think path finding algorithms are most relevant. 


15 responses 


Satellite navigation 
Game development 


Delivery and transportation 


Do you think we should place importance into research into developing efficient algorithms 
as much as developing new hardware? 


15 responses 


@ Yes 
® No 


Appendix H - Course completion evidence 
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Instructors Note 


FA Alexander S. Kulikov 


Welcome to Introduction to Graph Theory! You're joining thousands of learners currently enrolled in the course. I'm excited to have you in 
the class and look forward to your contributions to the learning community. 


To begin, | recommend taking a few minutes to explore the course site. Review the material well cover each week, and preview the 
assignments you'll need to complete to pass the course. Click Discussions to see forums where you can discuss the course material with 


Week 4 


Graph Parameters 


Videos O) Done 
Readings O) Done 


Other O Done 


Week 1 


What is a Graph? 


videos O Done 
Readings ® Done 


Other O Done 


ase post them 
earner Help C Week 5 


joy the course! 


Flows and Matchings 


Videos O Done 
Readings © Done 


Practice Exercises O Done 


Week 2 Week 3 


Cycles Graph Classes 


Videos O) Done videos O Done 
Readings © Done Readings © Done 
Other O Done 


Practice Exercises © Done 


Other © Done 


51 


Works Cited 


“A* Search Algorithm in Python” A Name Not Yet Taken AB, 22 Jan. 2020, 


www.annytab.com/a-star-search-algorithm-in-python. 


Choudry, Humzah., 2017. Understanding Algorithm Efficiency And Why Its’s Important. 
[online] Medium. <https://medium.com/@humzah.choudry/understand-algorithm- 


efficiency-and-why-its-important-89df0d5dfb64>. 


Cormen, Thomas H. et al. Introduction to Algorithms, 3rd ed., MIT Press, 2009, pp. 589— 


658. 


“Dijkstra Algorithm: Example: Time Complexity.” Gate  Vidyalay, 2020, 


www.gatevidyalay.com/tag/dijkstra-algorithm-pseudocode/. 


Hart, P., Nilsson, N., & Raphael, B. (1968). A Formal Basis for the Heuristic Determination 
of Minimum Cost Paths. IEEE Transactions on Systems Science and Cybernetics, 4(2), 


100-107. doi:10.1109/tssc.1968.300136 


Kask, Kalev. Lecture 4: Optimal and heuristic search, lecture notes, ICS 271, University of 


California, Irvine, 2016. 


Kulikov, Alexander S. “Introduction to Graph Theory." University of California San Diego, 
National Research University Higher School of Economics. Coursera. Online Lecture 


<https://www.coursera.org/learn/graphs> 


Lanning, D. R., Harrell, G. K., & Wang, J. (2014). Dijkstra"s algorithm and Google maps. 
Proceedings of the 2014 ACM Southeast Regional Conference on - ACM SE 


"14. doi:10.1145/2638404.2638494 


52 


Paria, Biswajit. Subject Expert, 2021 


Pound, Mike. “A* (A Star) Search Algorithm - Computerphile.” YouTube, uploaded by 


Computerphile, 15 Feb. 2017, www.youtube.com/watch?v=ySN5Wnu8ß8nE. 


Pound, Mike.  “Dijkstra’s Algorithm - Computerphile.” YouTube, uploaded by 


Computerphile, 4 Jan. 2017, www.youtube.com/watch?v=GazC3A40QTE. 


Russell, Stuart J, and Peter Norvig. Artificial intelligence: a modern approach. Englewood 


Cliffs, N.J: Prentice Hall, 1995. Print. 
Adamchik, Victor S. Algorithmic Complexity, Carnegie Mellon University, 2009 


Sedgewick, Robert and Philippe Flajolet. An introduction to the analysis of algorithms 


Second Edition. Addision-Wesley, 2013. 


Sharma, Akash. “Time and Space Complexity Tutorials & Notes | Basic Programming.” 
HackerEarth, 30 Aug. 2016, www.hackerearth.com/practice/basic- 


programming/complexity-analysis/time-and-space-complexity/tutorial. 


Quinn, Catherine, et al. Mathematics for the international student : mathematics HL 
(option) : discrete mathematics, HL topic 10, FM topic 6, for use with IB diploma 


programme. Marleston, SA: Haese Mathematics, 2014. Print. 


53 


